Binary Neural Networks Algorithms, Architectures, and Applications (Baochang Zhang, Sheng Xu, Mingbao Lin etc.)

164

Applications in Computer Vision

FIGURE 6.6

Detailed architecture of 1-bit networks implemented by us. (a) detailed architecture of 1-

bit PointNet. MM denotes matrix multiplication in short; (b) detailed architecture of 1-bit

PointNet++. Cat denotes the concatenation operation; (c) detailed architecture of 1-bit

DGCNN; (d) detailed architecture of the FC unit and the Bi-FC unit used from (a) to (c).

We use 2 BNs in the Bi-FC Unit.

Updating pi: We ﬁnally update other parameters pi with wi and αi ﬁxed. δpi is deﬁned

as the gradient of pi. We formulate it as

δpi = ^∂L^S

∂pi

(6.63)

pi ←pi −ηδpi.

(6.64)

The above derivations show that POEM is learnable with the BP algorithm. Our POEM

is supervised on the basis of a simple and eﬀective reconstruction loss function. Moreover, we

introduce an eﬃcient Expectation-Maximization algorithm to optimize unbinarized weights,

thus constraining them to formulate a bimodal distribution.

6.3.5

Ablation Study

Hyper-parameter selection: There are hyperparameters λ and τ in Eqs. 6.44 and 6.58

that are related to the reconstruction loss and the EM algorithm. The eﬀect of parameters

λ and τ is evaluated in ModelNet40 for 1-bit PointNet, the architectural details of which

can be found in Fig. 6.6 (a). The Adam optimization algorithm is used during the training

process, with a batch size of 592. Using diﬀerent values of λ and τ, the performance of

POEM is shown in Table 6.2. In Table 6.2, from left to right lie the overall accuracies (OAs)

with diﬀerent λ from 1×10⁻³to 0.

And the OAs with diﬀerent τ from 1×10⁻²to 0 lie from top to bottom. With a decrease

of λ, the OA increases ﬁrst and then drops dramatically. The same trend is shown when we